scalable model
Scalable spatial point process models for forensic footwear analysis
Manna, Alokesh, Spencer, Neil, Dey, Dipak K.
Shoe print evidence recovered from crime scenes plays a key role in forensic investigations. By examining shoe prints, investigators can determine details of the footwear worn by suspects. However, establishing that a suspect's shoes match the make and model of a crime scene print may not be sufficient. Typically, thousands of shoes of the same size, make, and model are manufactured, any of which could be responsible for the print. Accordingly, a popular approach used by investigators is to examine the print for signs of ``accidentals,'' i.e., cuts, scrapes, and other features that accumulate on shoe soles after purchase due to wear. While some patterns of accidentals are common on certain types of shoes, others are highly distinctive, potentially distinguishing the suspect's shoe from all others. Quantifying the rarity of a pattern is thus essential to accurately measuring the strength of forensic evidence. In this study, we address this task by developing a hierarchical Bayesian model. Our improvement over existing methods primarily stems from two advancements. First, we frame our approach in terms of a latent Gaussian model, thus enabling inference to be efficiently scaled to large collections of annotated shoe prints via integrated nested Laplace approximations. Second, we incorporate spatially varying coefficients to model the relationship between shoes' tread patterns and accidental locations. We demonstrate these improvements through superior performance on held-out data, which enhances accuracy and reliability in forensic shoe print analysis.
- North America > United States > West Virginia (0.04)
- North America > United States > District of Columbia > Washington (0.04)
- North America > United States > Connecticut > Tolland County > Storrs (0.04)
- (2 more...)
Tomography of the London Underground: a Scalable Model for Origin-Destination Data
The paper addresses the classical network tomography problem of inferring local traffic given origin-destination observations. Focussing on large complex public transportation systems, we build a scalable model that exploits input-output information to estimate the unobserved link/station loads and the users path preferences. Based on the reconstruction of the users' travel time distribution, the model is flexible enough to capture possible different path-choice strategies and correlations between users travelling on similar paths at similar times. The corresponding likelihood function is intractable for medium or large-scale networks and we propose two distinct strategies, namely the exact maximum-likelihood inference of an approximate but tractable model and the variational inference of the original intractable model. As an application of our approach, we consider the emblematic case of the London Underground network, where a tap-in/tap-out system tracks the start/exit time and location of all journeys in a day. A set of synthetic simulations and real data provided by Transport For London are used to validate and test the model on the predictions of observable and unobservable quantities.
- Transportation > Passenger (0.99)
- Transportation > Ground > Rail (0.99)
- Transportation > Infrastructure & Services (0.60)
Reviews: Tomography of the London Underground: a Scalable Model for Origin-Destination Data
I thank the authors for the clarification in their rebuttal. It is even more clear that the authors should better contrast their work with aggregate approaches such as Dan Sheldon's collective graphical models (e.g., Sheldon and Dietterich (2011), Kumar et al. 2013, Bernstein and Sheldon 2016). Part of the confusion came from some of the modeling choices: In equation (1) the travel times added by one station is Poisson distributed?! Poisson is often used for link loads (how many people there are in a given station), not to model time. Is the quantization of time too coarse for a continuous-time model? Wouldn't a phase-type distribution(e.g., Erlang) be a better choice for time? Such modeling choices must be explained.
- Transportation > Passenger (0.40)
- Transportation > Ground > Rail (0.40)
Studying Drowsiness Detection Performance while Driving through Scalable Machine Learning Models using Electroencephalography
Rogel, José Manuel Hidalgo, Beltrán, Enrique Tomás Martínez, Pérez, Mario Quiles, Bernal, Sergio López, Pérez, Gregorio Martínez, Celdrán, Alberto Huertas
- Background / Introduction: Driver drowsiness is a significant concern and one of the leading causes of traffic accidents. Advances in cognitive neuroscience and computer science have enabled the detection of drivers' drowsiness using Brain-Computer Interfaces (BCIs) and Machine Learning (ML). However, the literature lacks a comprehensive evaluation of drowsiness detection performance using a heterogeneous set of ML algorithms, and it is necessary to study the performance of scalable ML models suitable for groups of subjects. - Methods: To address these limitations, this work presents an intelligent framework employing BCIs and features based on electroencephalography for detecting drowsiness in driving scenarios. The SEED-VIG dataset is used to evaluate the best-performing models for individual subjects and groups. - Results: Results show that Random Forest (RF) outperformed other models used in the literature, such as Support Vector Machine (SVM), with a 78% f1-score for individual models. Regarding scalable models, RF reached a 79% f1-score, demonstrating the effectiveness of these approaches. This publication highlights the relevance of exploring a diverse set of ML algorithms and scalable approaches suitable for groups of subjects to improve drowsiness detection systems and ultimately reduce the number of accidents caused by driver fatigue. - Conclusions: The lessons learned from this study show that not only SVM but also other models not sufficiently explored in the literature are relevant for drowsiness detection. Additionally, scalable approaches are effective in detecting drowsiness, even when new subjects are evaluated. Thus, the proposed framework presents a novel approach for detecting drowsiness in driving scenarios using BCIs and ML.
- North America > United States (0.28)
- Europe > Switzerland > Zürich > Zürich (0.14)
- Europe > Spain > Region of Murcia > Murcia (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Tomography of the London Underground: a Scalable Model for Origin-Destination Data
Colombo, Nicolò, Silva, Ricardo, Kang, Soong Moon
The paper addresses the classical network tomography problem of inferring local traffic given origin-destination observations. Focussing on large complex public transportation systems, we build a scalable model that exploits input-output information to estimate the unobserved link/station loads and the users path preferences. Based on the reconstruction of the users' travel time distribution, the model is flexible enough to capture possible different path-choice strategies and correlations between users travelling on similar paths at similar times. The corresponding likelihood function is intractable for medium or large-scale networks and we propose two distinct strategies, namely the exact maximum-likelihood inference of an approximate but tractable model and the variational inference of the original intractable model. As an application of our approach, we consider the emblematic case of the London Underground network, where a tap-in/tap-out system tracks the start/exit time and location of all journeys in a day.
- Transportation > Passenger (0.79)
- Transportation > Ground > Rail (0.79)
- Transportation > Infrastructure & Services (0.63)
AI Challenge Problem: Scalable Models for Patterns of Life
We describe how computational POL modeling integrates diverse artificial intelligence research areas and provides interesting challenges in multiple fields. Simultaneously, these patterns of life impose structure on individual decisions. For example, a pattern of rush hour traffic arises from drivers' decisions to commute at a certain time. Knowledge of rush hour influences individuals' departure times. Modeling POL is not only an academic pursuit.
- Law (1.00)
- Information Technology (1.00)
- Transportation > Ground > Road (0.89)
AI Challenge Problem: Scalable Models for Patterns of Life
Folsom-Kovarik, J. T. (Soar Technology, Inc.) | Schatz, Sae (MESH Solutions, LLC, a DSCI Company) | Jones, Randolph M. (Soar Technology, Inc.) | Bartlett, Kathleen (MESH Solutions, LLC, a DSCI Company) | Wray, Robert E. (Soar Technology, Inc.)
AI Challenge Problem: Scalable Models for Patterns of Life
Folsom-Kovarik, J. T. (Soar Technology, Inc.) | Schatz, Sae (MESH Solutions, LLC, a DSCI Company) | Jones, Randolph M. (Soar Technology, Inc.) | Bartlett, Kathleen (MESH Solutions, LLC, a DSCI Company) | Wray, Robert E. (Soar Technology, Inc.)
- North America > United States > Virginia > Arlington County > Arlington (0.04)
- North America > United States > North Carolina > Wake County > Raleigh (0.04)
- North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
- (6 more...)
- Transportation (0.95)
- Information Technology (0.95)
- Law (0.94)
- (2 more...)
Scalable Models for Patterns of Life
Folsom-Kovarik, Jeremiah T. (Soar Technology, Inc.) | Schatz, Sae (MESH Solutions, LLC, a DSCI Company) | Jones, Randolph M. (Soar Technology, Inc.) | Bartlett, Kathleen (MESH Solutions, LLC, a DSCI Company) | Wray, Robert E. (Soar Technology, Inc.)
Patterns of life (POL) are emergent properties of complex social systems. Computational models of POL offer significant potential for practical application and theoretical study, but also important challenges for AI research. Computational POL models must achieve simultaneous scalability along three key dimensions: population size, intelligence, and automatic behavior specification. Three broad research areas that could support important improvements in POL modeling are pattern recognition, representational abstraction, and behavior generation with intelligent agents and the like. This paper describes challenges in POL modeling that AI researchers from many fields can help to meet.